Temporal Reachability Graphs 



John Whitbeck+t Marcelo Dias de Amorimt Vania Conan* 

john.whitbeck@lip6.fr marcelo.amorim@lip6.fr vania.conan@fr.thalesgroup.com 

Jean-Loup Guillaumet 
jean-loup.guillaume@lip6.fr 



f UPMC Sorbonne Universites 
4, place Jussieu 
75005 Paris 
France 



J Thales Communications & Security 
160 Boulevard de Valmy 
92700 Colombes 
France 



ABSTRACT 

While a natural fit for modeling and understanding mobile 
networks, time-varying graphs remain poorly understood. 
Indeed, many of the usual concepts of static graphs have 
no obvious counterpart in time-varying ones. In this paper, 
we introduce the notion of temporal reachability graphs. A 
(t, 5) -reachability graph is a time-varying directed graph de- 
rived from an existing connectivity graph. An edge exists 
from one node to another in the reachability graph at time 
t if there exists a journey (i.e., a spatiotemporal path) in 
the connectivity graph from the first node to the second, 
leaving after t, with a positive edge traversal time r, and 
arriving within a maximum delay S. We make three contri- 
butions. First, we develop the theoretical framework around 
temporal reachability graphs. Second, we harness our the- 
oretical findings to propose an algorithm for their efficient 
computation. Finally, we demonstrate the analytic power 
of the temporal reachability graph concept by applying it 
to synthetic and real-life datasets. On top of defining clear 
upper bounds on communication capabilities, reachability 
graphs highlight asymmetric communication opportunities 
and offloading potential. 
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Figure 1: Snapshots of a time-varying graph with 
five nodes. 
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1. INTRODUCTION 

In time- varying graphs (TVG), vertices and edges appear 
and disappear as a function of time. Alternatively called 
temporal networks [15] or evolving graphs [29], time- varying 
graphs have emerged over the past few years as a key model 
for a variety of complex systems |10| |13| |16| |26[ |27| . Time- 
varying graphs can also serve as a solid theoretical frame- 
work for investigating fundamental properties of mobile net- 
works [3] [9]. In Fig.Q we show an example of a time- varying 
graph representing a five-node mobile network. 

Many of the typical concepts of static graphs such as 
paths, distance, diameter, or node degree have no obvious 
counterpart in time- varying graphs. Theorems that are true 
on static graphs may not hold in time-varying ones [15] 
and dynamic equivalents of well-known problems on static 
graphs, such as finding strongly connected components, turn 
out to be intractable [21. Naturally, it is always possible 
to track the evolution of static metrics such as node degree 
or clustering on snapshots of the time- varying graph at cer- 
tain time intervals [6] but, while instructive, this approach 
often fails to capture the correlations between successive 
snapshots 14 . Recently a number of concepts specific to 
time- varying graphs have been studied such as journeys [29] , 
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temporal diameter [Bl, or reachability time 

In the context of opportunistic mobile networking, much 
attention has been focused on aggregate inter-contact time 
distributions. These have been found to fit a power-law 
(with an a parameter smaller than 1) followed by an expo- 
nential cutoff in a number of real-life datasets [3. Should this 
power-law also hold for pairwise inter-contact times, then 
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Table 1: Notations used in this paper 



Figure 2: Temporal reachability graphs obtained 
from the time-varying graph of Fig. [l] for <5=ls 
and 2s. Arrows indicate uni-directional communi- 
cation opportunisties, while their absence indicates 
bi-directional ones. We show snapshots for different 
times t. We assume in this example that one-hop 
communications require r=ls. 



the expected values of these would be infinite, a pessimistic 
result indeed. However, further work has shown that aggre- 
gate power-law distributions can emerge from a diversity of 
pairwise inter-contact laws (including exponential) [7[ [22] . 
Moreover, all information on multi-hop connectivity is ig- 
nored. Inter-contact time measurements do not therefore 
suffice to characterize connectivity in mobile networks. 

Temporal reachability graphs (TRG), our contribution, on 
the other hand, offer an immediate view of communication 
possibilities in a dynamic network. Given a time-varying 
graph Q, an arc exists from vertex a to vertex 6 at time t in 
its derived (t, 5) -reachability graph if a journey exists in Q 
from a to & leaving a after time t and arriving at b before 
t + 5, given that each single-hop communication takes time r. 
Here, r is the edge traversal time and 5 is the upper bound 
on journey times in Q. By definition, temporal reachability 
are directed time-varying graphs. For example, in the con- 
text of studying information dissemination in delay-tolerant 
networks (DTN) [ll], r would be the one- hop transmission 
delay and 5 is the maximum tolerated delay. 

In Fig. [2] we show the instances of the temporal-reachability 
graph extracted from the opportunistic mobile networks of 
Fig. [l] at different times (assuming that the graph does not 
change between the snapshots shown in Fig.[T|. We consider 
in this example that the link traversal time is r=ls. When 
5 = Is, the only possible journeys in Q at t £ {Is, 2s, 3s} are 
the direct links between nodes - communications tolerate 
a delay of Is, which is the time to traverse one single link. 
When 8 = 2s, several more journeys appear. Consider for 
instance the reachability graph at t = Os. There is a directed 
link between a and d as a can wait one time unit for the link 
(6, d) to appear (a ~> b and & ~> d); thus, a two-hop journey 
a d through b becomes possible. 

We make three main contributions: 

Temporal reachability graphs. We formalize the concept 
of reachability graphs. In particular, under conditions that 
hold for all empirical datasets, we prove an additive prop- 
erty on the delay of reachability graphs. Roughly speaking, 
knowledge of the reachability graphs for delays S and \i is 
enough to derive the reachability graph for delay S + ji. 
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Algorithm for efficient computation of TRG. We trans- 
late this additive property into an efficient single-pass stream- 
ing algorithm [T] to calculate entire families of reachability 
graphs. 

Insights into real-world mobility traces. Once calcu- 
lated, reachability graphs yield many original insights on the 
temporal structure of the time-varying graphs they derive 
from. In this paper, we calculate the reachability graphs 
of several synthetic and real-life high-resolution connectiv- 
ity traces [l8 20 23, 25 . These highlight the concepts of 
temporal connectivity, temporal asymmetry, and temporal 
dominating set. From the point of view of opportunistic 
communications, reachability graphs immediately provide, 
at all times, the maximum delivery ratio and the minimum 
set of users required to broadcast a message to the entire 
network, thereby offering a complete view of opportunistic 
communication capabilities and offloading potential. 

The rest of the paper is structured as follows. Sections [2] 
and [3] introduce the concepts of time- varying and reachability 
graphs. Their theoretical properties justify their computation 
by a streaming algorithm detailed in Section[4] Section[5]then 
analyzes the reachability graphs of synthetic and real-life 
connectivity graphs. Related work is presented in Section [6] 
Finally, Section [7] concludes. 

2. TIME-VARYING GRAPHS 

Time- varying graphs are a very useful high-level abstrac- 
tion for studying connectivity over time in mobile network. 
Terms, notations, and definitions around time- varying graphs 
vary considerably, but recently, Casteigts et al. have proposed 
a unified framework for TVGs and, wherever applicable, we 
will use their definitions and notations [3]. 

Definition 1. (Time- varying graph) Let V be a set of 

vertices, and E C- V x V the set of possible edges between 
vertices in V. Events occur over a time span TCT, where 
T is the temporal domain (N for discrete-time systems or R+ 
for continuous-time ones). In the general case, a TVG is a 
tuple G = (V, E, T, p, C) where p : E x T ->■ {0, 1}, called 
presence function, indicates whether a given edge exists at 
a given time and C, : E x T — > T, called latency function, 
indicates the time it takes to cross a given edge if starting at 
a given date (the latency of an edge could vary in time). 



In this paper, we will mostly consider continuous-time 
TVGs (i.e., T = R+). Furthermore, we will always assume 
a constant £ function such that V(e, t) G E x T, C(e, t) = t 
where r > is our uniform edge traversal time. We also 
note Q(t) C E the set of edges in the snapshot of Q at time 
t. Hence e € Q(t) p{e,t) — 1. Finally, given an edge 
e = (u, v) G E, we define from(e) = u and to(e) = v. 

Definition 2. (Inclusion) Let Q and Q' be two TVGs that 
differ only by their presence functions p and p' . We write 
G C g' if and only if Vt G T,Ve G E,p(e,t) < p'(e,t), or 
equivalently if and only if Vt G 7", g(t) C g'(t). 

Definition 3. (Union) Let (J, 'H, and X be three TVGs 
that differ only by their presence functions. We write g = 
■HUlif and only if Vt G T,Q{t) = U(t)U X{t). 

Definition 4- (Journey) A journey in g is a sequence of 
couples J — {(ei,ii), (e2,ta), • ■ ■ , (ek,tk)} such that Vi < k: 
(i) to(e.i) = from(e;+i); (ii) Vt s.t. ti <t < f,: + r, p(ei,t) = 1; 
and (hi) ti+i > ti + r. Here, \ J\ = k is this journey's 
topological length (i.e., the number of hops). Furthermore, 
departure^) and arrival^) denote the starting date ti and 
the last date tk + r, respectively. Finally 5j = arrival^) — 
departure( e 7) is the journey's temporal length. A journey may 
represent, for example, the sequence of hops that a message 
follows through an opportunistic network. 

3. TEMPORAL REACHABILITY GRAPHS 

Now that we have the necessary background, let us formally 
define the notion of temporal reachability graphs (or simply 
reachability graphs in the remainder of this paper, for the 
sake of readability). Furthermore, we write N* for N \ {0}. 
To the best of our knowledge, the definitions and results 
presented in this section are completely novel. 

3.1 Reachability Graphs 

Definition 5. (Reachability graph) For 5 G R+, let TZg 

be the reachability graph with maximum delay 5 derived from 
g (with edge traversal time r). Formally, Vt, (u,v) G lZs(t) if 
and only if u ^ v and there exists at time t in g a journey J 
from u to v such that departure( e 7) > t and arrival(,y) < t + 5. 

Note that if 5 < r then TZs is empty, as even one- hop jour- 
neys do not have enough time to arrive before the maximum 
delay. 

Proposition 1 
ability graphs of g . 



Theorem 1 (Decomposition). Let g be a TVG with 
edge traversal time r and TZ = {7Zs}s>o the set of all its 
reachability graphs. Let 5 > and p > r. Then: 



(Growth). Let IZs andlZ^ be two reach- 
Then 5 < n => TZs^TZ^. 



This follows naturally from the definition of a reachability 
graph. Note that the reverse is not true. For example let 
g be a TVG with edge traversal time r > 0. Its derived 
reachability graphs TZ T /2 an <A 1Z T /3 are both empty as their 
maximum delays are smaller than the time it takes to cross 
one edge. Therefore, as per definition [2] 1Z T /2 C 1Z T /- A even 
though | > §• 

Definition 6. (Composition) Let IZs and TZ^ be two 
reachability graphs of g. We define their composition TZs'SiRii 
as the TVG such that, at all times t, 



(u,v) G (TZ S ®ft M ) (t) <^ < 



(u,v) G 1Zs(t) 
(u,v) G ft M (t- 
3w G V, (u, w) 
(w, v) G IZ^ 



F<5), or 
G TZ s (t) and 
(t + 5) 



TZ, 



U 

0<£<1 



While not yet in a calculable form, this theorem is at the 
heart of our approach. Intuitively, it states that if all the 
reachability graphs with delays close to S and p, are known, 
then one can calculate the reachability graph with delay 5 + [i. 
A full formal proof is provided in the appendix (Section [aTTJ, 
but intuitively, it relies on the following simple idea. Any arc 
in TZs+fi corresponds to a journey in g starting after time t 
and arriving before t + S + ji. Either it can be neatly divided 
into a (possibly empty) journey in IZs starting after t and 
arriving before t + 5 and another (possibly empty) journey 
in TZfj, starting after t + 5 and arriving before t + 5 + p,, or it 
starts crossing an edge before t + 5 but after t + 5 — t that 
straddles both time intervals. In the latter case, since the 
edge traversal time is r, one can incorporate the straddling 
edge into a journey arriving before t + 5 + er and the rest into 
another journey leaving after t + 5 + et of temporal length 
less than p — et. 

3.2 Regular reachability graphs 

Real-life datasets all have a maximum resolution (e.g., a 
second or a millisecond) that corresponds to the precision 
with which they were measured. While it is tempting to map 
their time domain T to N, in reality one must account for 
0-second edge durations. For example, if the edge traversal 
time t is null, then even ephemeral edges that last seconds 
may be part of a journey. In the more common situation 
where r is strictly positive, one may still encounter 0-duration 
arcs in a reachability graph. For example, consider an edge 
(u, v) that is present for only one second. If r and 5 are both 
also equal to one second, then a one-hop journey from u to 
v using that edge only exists at precisely the instant t that 
the edge appears. This, in turn, corresponds to two 0-second 
arcs in 1Zs(t): (u,v) and (v,u). Note that both of these arcs 
will become one-second-long arcs in the reachability graph 
when 5 = 2s. These observations lead to the definition of 
regular time-varying graphs. 

Definition 7. (Regular TVG) A TVG g is an ^-regular 
TVG if there exists rj > such that Vfc G N, kn < ti < t 2 < 
(k + l)r? => Q(ti) = G(t 2 ) C g(kn). Here, n is called £'s 
resolution. The time interval [km (k + l)n[ is the k th epoch 
of g with starting time kn and ending time (k + l)n. 

Without loss of generality, this definition assumes that the 
first epoch starts at t = 0. Intuitively, an 77-regular TVG 
is one whose instantaneous graph topology cannot change 
arbitrarily quickly, as, during each epoch, the graph topology 
remains constant. Ephemeral 0-second edges or arcs may 
exist at the start of an epoch, but the TVG then remains 
constant until the start of the next epoch. Not only are 
regular TVGs a natural fit for real-life traces, but, as we will 
see, they also represent a class of TVGs whose reachability 
graphs are calculable. 

Theorem 2 (Regular reachability graphs). Letg be 
an n-regular TVG whose edge traversal time is r G r/N* . For 
5 G ??N, let TZg be a reachability graph of g . Then TZs is an 
n-regular TVG and Vfe G N, kn < t < (k + l)n =4> TZsit) C 
TZs{kn)mZ s {{k + l)n). 



This theorem ensures that, as long as 8 is a "multiple" of 
the resolution 77, TZs carries on the ^-regularity of Q. The 
full formal proof is a little technical (Section |A.2| in the 
appendix), but it relies on the following idea. Let us consider 
a one-hop journey in an 77-regular TVG Q leaving in the 
middle of an epoch at time t. Let k be the integer such 
that kij < t < (k + l)r]. It arrives in the next epoch at 
time (k + l)n < t + t < (k + 2)n (the theorem assumes that 
r > 77). Because Q is 77-regular, the departure time of this 
one-journey can be nudged forwards or backwards as long as 
it remains with the same epoch. Hence all one-hop journeys 
with a departure time t' such that kn < t' < (k + 1)77 are 
also valid journeys in Q. This "nudging" can be extended, 
though not trivially, to multi-hop journeys which proves the 
theorem. 

Furthermore, the r > 77 hypothesis guarantees that an arc 
present in a reachability graph during an epoch is present 
not only at the start of the epoch like in any regular graphs 
both also at the start of the next epoch. This property 
will be leveraged later in this section. However TZg(t) 7^ 
TZs(krj) (~1 TZs ((k + 1)77). An example of such a situation is 
discussed in Section [3.31 



Theorem 3 (Sampling). Let Q be an n-regular TVG 
whose edge traversal time is r = 7177 with n G N* . Let TZ = 
{7ti7]}i£N be the set of all its derived n-regular reachability 
graphs. For (d, m) £ N x N such that m > n, we have, 
Va G N: 



7Z-(d+m)»j(0'7) — [J {7Z{d+k)-q 
0<k<n 



TZ 



-*)»). 



(a??). 



This theorem is the adaptation of Theorem [T] to regular 
TVGs. The formal proof is in the appendix (Section A. 3 1 
but, as previously, it relies on the idea that within an epoch, 
there exists a little freedom to "nudge" journey departure 
and arrival times backwards or forwards. In a sense, if both 
the minimum departure time and maximal arrival time of 
a journey fall exactly at the start of a time epoch, then 
this journey can be divided into two sub-journeys whose 
departure and arrival times also map exactly to the start of 
epochs. Under this form, the value of a reachability graph 
at the start of each epoch can be exactly calculated from the 
values, at the start of each epoch, of the proper set of r pairs 
of reachability graphs. 

Unfortunately, this exact formula does not extend to the 
time spent strictly within an epoch. In a way, the state of the 
reachability graph during epochs is more important than its 
state at their starting times. Indeed, the time spent at exactly 
the start of epochs is infinitesimally small. Therefore any 
metric averaged over time (e.g. average density) will depend 
only on the states strictly within epochs. In the next section, 
we propose an upper and a lower bound on the reachability 
graph during epochs. As we will later see in Section [5] these 
upper and lower bounds are in fact nearly always equal 
and therefore achieve an excellent approximation of the real 
reachability graph. 

3.3 Upper and lower bounds 

We define for every regular reachability graph TZs an upper 
and lower bound, i.e., two TVGs Us and Cs such that Cs C 
TZs C Us ■ Both of these bounds are equal to the reachability 
graph at the start of each epoch but only differ slightly during 
the epoch. The upper bound is a straightforward application 
of Theorem H 



Proposition 2 (Upper Epoch approximation). Let 

Q be an n-regular TVG whose edge traversal time is t G 77N*. 
For 8 e 77N, let TZs be a reachability graph of Q . We de- 
fine TZs 's upper approximation, the TVG Us, as follows 
(Van <t<(a+l)n with a G N): 

• if t = (177, then Us{an) = TZs(an); 

• ift> an, then Us(t) = TZs(an) n TZs {(a + l)n). 
As defined, V<5, TZs C Us ■ 

The lower bound is a little more complicated and involves 
a modified version of the composition operator (Definition [61. 

Definition 8. (Approximate composition) Let TZs and 

TZ^ be two reachability graphs of an 77-regular TVG Q with 
8 and fj, in 77N*. Let Cs and £ M be two TVGs such that 
Cs C TZs and C TZ^. For all times an < t < (a + l)n with 
a G N, we define their approximate composition Cs C^ as 
follows: 

• if t = a??, tnen (£<5 © AO ( ar l) = (R-s Hp) (an); 

• if t > an, then 



(u,v) G (CsQCp) (t) & < 



(u, v) G TZs ((a + 1)77) , or 
(u, v) G TZp(ar} + 8), or 
3u> G V, (u,w) G Cs{t) and 
(w,v) G C^t + 8) 



By definition, at the start of each epoch the approxi- 
mate composition is equal to the regular composition of 
the reachability graphs. During an epoch, the condition 
(u, v) G TZs {{ a + I)*?) is easier to meet than the more in- 
tuitive (u,v) G Csit). Indeed, with Theorem [2] Csit) C 
TZs(t) C TZs ((a + 1)77), and this composition operator will 
catch the arcs (including the 0-second ones) occurring at 
the start of the next epoch. The same is true for the 
(u, v) G TZpiar) + 8) condition for arcs ending at the start 
of the current (5-shifted) epoch. Combined, these more 
inclusive conditions make for a tighter lower bound. 

Proposition 3 (Lower Epoch approximation). Let 

Q be an n-regular TVG whose edge traversal time is T — nn 
with n G N*. Let TZ — {TZs}ssv^ be the set of all its de- 
rived reachability graphs. We recursively define TZs 's lower 
approximation (i.e., the TVG Cs) as follows. Ford < 2n, 
dr, = TZdri and for all [d, m) in N x N such that d> n and 
771 > n, 

£-{d+m)r] — £-{d+k)-q © £(m-k)r)- 

0<k<n 

As defined, Vd G N, C dv C TZ dv ■ 

The proof, detailed in Section |A.4| of the appendix, is 
fairly straightforward and involves setting the value of e in 
Theorem [T] as a function of 77, t, k, and n. 

This recursive definition means that the exact value of Cs 
during a time epoch depends on the sequence of compositions 
that was used to calculate it. For example, in the simple 
situation where r = 77 = 1, the lower bounds £4 = C2 © Ci 
and C' 4 = C3 Ci may be different. Proposition [3] only 
guarantees that they are both included in the real reachability 
graph TZ4. However, as we will see, this is not a concern 
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Figure 3: Upper epoch approximation with t = r\ — 
1. Arrows point in the direction of time. Intervals 
denote the presence of an edge or arc during that 
time. Dots are intervals reduced to a single point in 
time. In this example the lower bound £2 is equal 
to the reachability graph 722 but the upper bound 
U2 is not. 
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Figure 4: Lower epoch approximation with r = 77 = 
1. Arrows point in the direction of time. Intervals 
denote the presence of an edge or arc during that 
time. Dots are intervals reduced to a single point in 
time. In this example the upper bound W4 is equal 
to the reachability graph 72-4 but the lower bound £4 
is not. 



because this lower bound is tight regardless of how it is 
calculated. 

Fig. [3] is an example of a situation with four vertices where 
the upper bound has an arc during an epoch that is not 
in the reachability graph. Here r = 77 = Is. 72i is exactly 
derived from Q by shortening the end time of each edge 
by t. For example, this leads to an ephemeral arc from a 
to b and a two-second-long arc from b to d (down from a 
three-second-long edge (b,d) in Q). Obviously, not all arcs 
are represented on Fig. [3] In 72,2, there are two ephemeral 
arcs from a to d, corresponding to the a b ~> d journey, 
and the a ~» c~» d journey an epoch later. In between those 
instants, it is both too late to use the first journey and too 
early to use the second. Since this arc exists at the start of 
two successive epochs, U2 errs in considering that it exists 
during the entire epoch. In this example, the lower bound 
£2 matches the reachability graph. 

Conversely, Fig. [4] is an example of a situation with three 
vertices where the lower bound misses an arc during an 



epoch that is present in the reachability graph. Here again 
t — rj = 1. The one-second-long edges in Q all become 
ephemeral 0-second arcs in 72. 1 (not shown on the figure), 
that in turn become one-second-long arcs in 72.2. Furthermore, 
722 also contains two ephemeral arcs (a, c) and (6, d) that 
correspond to the a ~> b ~> c and b ~» c ~> d journeys 
respectively. Here 722 = £2 = U%. Finally 724 contains an 
arc (a, d) that corresponds to the existence of the three-hop 
journey a ~» b ~> c ~> d. £4 correctly identifies this journey 
at two instants k and k + 1. At time k, it composes arc (a, b) 
from 722 (fe) with the ephemeral arc (6, d) in 722 (k + 2). At 
time k + 1 it composes the ephemeral arc (a, c) in 722 (k + 1) 
with the arc (c, d) in 722(fc + 3). However, for k < t < k + 1, 
£4 has no way of finding the journey from a to 6. Note that 
in this case, VI4 is equal to 724. 

The example of Fig. [4] also helps us understand the ab- 
sence of error propagation when calculating successive lower 
approximations from previous lower approximations. In this 
example, thanks to the conditions in Definition [8] that refer 
to the starting times of epochs, £s will "bridge" over the 
missing (a,d) arc in £4. However it may miss composed 
arcs that use the (a,d) arc in 724. These missed arcs will 
be, in-turn, "bridged" in, for instance, a Lia approximation. 
Roughly speaking, errors during epochs do not propagate 
beyond two compositions. Combined with the fact that the 
lower approximation matches the reachability graph at the 
start of each epoch, this leads, as we will verify empirically, 
to a tight lower approximation or the reachability graph. 

3.4 A few case studies 

In this section, we examine how the theoretical results 
above apply to some simple situations. 

Zero edge traversal time. The reachability graphs of 
77-regular TVGs with zero edge traversal time (r = 0) are 
very easy to calculate. In particular (u,v) € 72o(f) if and 
only if u and v are in the same connected component at time 
t. Similarly, 72,, (i) can be calculated from the connected 
components at time t and t + 77. Furthermore, Theorem [l] 
becomes: 72^+^ = 1Zs ®72 M . This relation can then be applied 
repeatedly (e.g., in a binary-exponentiation) to obtain any 
72i with S G rjN* . Furthermore, the reachability graphs thus 
calculated are exact. However, in this specific case where 
the edge traversal time is null, more efficient algorithms 
for calculating reachability graphs exist. For example the 
algorithm proposed by Chaintreau et al. for computing delay- 
optimal paths could be easily adapted to this purpose [5]. 

Several hops per epoch. While all of the results on regular 
reachability graphs in Sections |3.2| and |3.3| assume that the 
edge traversal time r is in rjW , they can be simply adapted 
to the case where r = 77/n with n € N*, i.e., when up to 
n edges may be crossed during a single epoch. Indeed, in 
this case, any 77-regular graph would also be r-regular, and 
we can apply all of our results as if r = 77. For example, 
Proposition [3] is reduced to £( d+m )^ = £d,, £ m ,,. 

Unit delay. For r > 0, the reachability graph 72 T is trivially 
calculable from Q. Indeed, for any edge (u,v) in Q that 
appears at time ii and disappears at time £2 > ii + T, the 
arcs (u, v) and (v, u) appear in 72 T at time £1 and disappear 
at time £2 — T. This derivation is simple but essential for 
bootstrapping iterations of lower bound compositions (e.g., 
repeated applications of Proposition [3| . 




Figure 5: Adding £4 and £g families to obtain an 
£,4® £-8 = £12 family. In this example r = 2 and 77 = 1. 

4. EFFICIENT COMPUTATION OF REACH- 
ABILITY GRAPHS 

4.1 Families of reachability graphs 

Another interesting property of lower approximations of 
reachability graphs is that an upper approximation may be 
derived from it. Indeed, since it contains the exact values of 
the reachability graph at the start of each epoch, it is trivial 
to calculate their intersection during each epoch. While the 
method described in this section focuses on the efficient com- 
putation of the lower approximations of reachability graphs, 
it simultaneously computes the upper approximations. 

At a high level, the algorithm presented in this section 
is a binary exponentiation on families of lower bounds of 
reachability graphs using a special additive operator. To 
simplify notations, we will consider in this section that n = 1 
and r G N*. 

Definition 9. (Lower bound family) Let Q be a 1-regular 
TVG whose edge traversal time is t £ N*. For d G tN, let 
Cd be the lower approximation of a reachability graph IZd 
of Q. For d > r, we define Cd, the family of Cd such that 
C d = {C d +i}_ T<i<T - 

Proposition 4 (Family additivity). Let Q be a 1-regular 
TVG whose edge traversal time is t G N* . Let TZd and lZ m be 
two reachability graphs of Q such that d G rN* and m G rN* , 
and Cd and C m the respective families of their lower ap- 
proximations. We define Cd ffi C m the set of TVGs such 
that 

Cd ffi C m — < Un<t<-T C-d+k © C m + i-k \ 

V U ^ K ^ T J 0<i<T 

U \ Uo<fc<T £d+i + k Cm-k \ 

K — J — T<1<0 

Then (i) any element Cd+m+i G Cd® C m with — r < i < t 
is a lower approximation oflZd+m+i, and (ii) any TVG in 
Cd+m can be calculated from t compositions of pairs of TVGs 
in Cd x Cm ■ 

Proof. Consider any lower approximation Cd+m+i £ 

Cd+m- K — T < i < 0, Cd+m+i = C(d+i) + m — 

U <fc<r Cd+t+kOCm-k (Theorem[3|. If < i < r, C d +m+t = 
£ d+ ( m+ j) = Uo<fe< T £<i+k Cm+i-k (Theorem^. □ 

We now have our self-sufficient elements, the lower bound 
families, and an additive operation © between them. For 



illustration, Fig. [5] details the process of adding £4 and Cs 
families to obtain an £4 ffi Cs = £12 family. The £3, £4, 
£5, £7, Cs, and £g lower bounds are combined as inputs for 
three applications of Proposition [3] that yield £n, £12, and 
£13. Viewed as a black box, this operation combines the £4 
and £8 families into an £12 family. 

The inner workings of the ffi operator are embarrassingly 
parallel. Indeed, each application of Proposition [3] can be 
run completely independently of the others. This opens the 
way for highly distributed implementations, whose speed 
will be determined by that of the composition of r pairs of 
lower bounds of reachability graphs. Accordingly, the next 
section proposes an efficient algorithm for this composition 
operation. 

4.2 Composing reachability graphs 

A time- varying graph Q may be stored as a time-indexed 
sequence of edge UP and DOWN events. For example, if at 
time t an event {(u,v), DOWN} occurs, then the edge (u,v) 
disappears at time t in Q{i). Such a representation is well 
suited for algorithms that sequentially examine all states of 
the TVG. 

We present a streaming algorithm for composing r reach- 
ability graph lower bounds as in Proposition [3] Streaming 
algorithms are well suited to TVGs as their memory require- 
ments do not depend on the duration of the trace but only 
on the number of vertices [l]. Memory is indeed a limited 
resource, as reachability graphs can become fully connected 
cliques and data structures such as adjacency matrices can- 
not therefore be considered sparse. Furthermore, in our case, 
a streaming algorithm facilitates a parallel implementation 
of the ffi operator as one process can read the input families, 
duplicate their events, and dispatch these to various workers, 
each calculating one composition operation. 

Algorithm [l] reads its input from r pairs of event streams 
{Cd+k,C m -k} and writes C d + m = U <fc<r ^-i+k C m -k to 
its output stream. It makes use of an arc counter that tracks 
how many of the conditions in the definitions of Cd+k 
Cm-k (see Definition [sj) are verified by each arc. When 
a previously down link fulfills one of these conditions, its 
counter is initialized to 1 and an UP event is written to the 
output stream (Line [5}. When this counter goes to the arc 
is removed from Cd+m. and a DOWN event is written to the 
output stream (Line [7]). Note that an arc may be brought 
up and down at the same time i if its counter goes to right 
after it appears (i.e., an ephemeral arc). In this case, the arc 
triggers both lines [5] and [jj 

In more detail, this algorithm directly maps to the three 
conditions in the definition of the approximate composition 
(Definition [8|: 

Condition 1. (u,v) G 1Zd+k(i + 1). This condition is 
handled on line [3] before making a decision on bringing arcs 
up or down at the previous epoch. Indeed, the output stream 
at time i can only be written to after reading the input 
streams up to time i + 1. 

Condition 2. (u,v) G 1Z m -k(i + d + k). This condition 
means that down events in C m -k must be delayed for one 
epoch before lowering an arc's counter. This accounts for 
the local delayedk lists that are processed on lines |16| to |19| 

Condition 3. 3w, (u,w) G Cd+k(t) and (w,v) G C m -k(t + 
d). This condition is checked by maintaining for each k 
two adjacency lists adjd+k and adj m -k and checking upon 



ALGORITHM 1: Lower bound composition 

Require: {Cd+k > £m— fc} II T pairs of input streams 
Ensure: Cd+ m // the output stream 

Local: {adjd+k> a dj m -k} t II T pairs of adjacency matrices 
Local: counter[] // arc counter 

Local: delayed^\\ // r lists of delayed down events 
for i <- to T do 

for k 4— to r — 1 do 

forall the {a, UP} 6 .C^+fe!*] do counter[a\-\ — \- 

forall the new arcs a do 

append {a, UP} to Cd+ m [i — 1] 

forall the arcs a s.t. counter[a] = do 
append {a, DOWN} to C d+m [i - 1} 

for k 4— to r — 1 do 

forall the {(«,«), CP} £ £d+fc[i] do 
add (u,v) to adjd+k 

forall the £ adj m _ j. do counter [(u, «))]H — h 

forall the {(«,«), CP} £ £ m _ fe [i + d + Ai] do 
counter[(u, v)]-i — h 
add (u, «) to adj m _i c 

forall the (w,u) £ adjd+k do cou?iter[(?i), u)]++ 

forall the a £ delayed], do counter[k] — 
delayed^ <— 

forall the {(u, «), DOVWV} £ C m - k [i + d + k] do 
add (m, t;) to delayed^ 
remove (u,v) from adjr m -fc 

forall the (w,u) £ adjd+k do coimter[(ui, ti)] — 

forall the {(u,v), DOWN} £ £ d+fc [i] do 
counter[(u, v)] — 
remove (u, v) from adjd+k 

forall the (v,w) £ adj m _ j. do counter[(u, w)] — 



UP/ DOWN events whether the end of an arc in adjd+k corre- 
sponds to the origin of an arc in adj m -k (lines [lO)ll| |14j[T5| 
20pl] and |24|25p . 



Table 2: Dataset characteristics. N is the number of 
vertices, M the number of edge UP/DOWN events, 
T the duration, P the beaconing period, and r\ the 
time resolution. 



Algorithm [T] is then used as a building block to implement 
the ffi addition of lower bound families. Starting from £Vs 
family, for any n £ N*, £„ T is obtained in log(n) applications 
of the © operation using a binary exponentiation process. 

An implementation of Algorithm [l] fully integrated into a 
binary exponentiation algorithm over lower bound families, 
is available as a part of our dynamic trace library (DiTL |28|). 
This package also contains the code for transforming a lower 
bound TVG into an upper bound TVG, as well as the time- 
varying dominating set computation used in Section [5] 

4.3 Complexity analysis 

The worst-case memory requirements for this one-pass 
streaming algorithm are straightforward. Its local memory 
contains a non-sparse arc counter that requires up to 0(N 2 ) 
space. Furthermore, for each < k < r, it maintains two 
non-sparse adjacency matrices and an arc event list thereby 
requiring 0(N 2 ) space. Adding everything together yields a 
worst-case space complexity of 0(tN 2 ) that is independent 
of the duration of the trace. 

Before examining the worst-case time complexity, a word 
must be said about the implementation of the adjacency 
matrices (the arc counter is backed by an adjacency matrix). 
In our implementation, these are backed by per-vertex hash 
tables. Insertion and removal are therefore constant time 
operations but this approach may not scale to TVGs with 
much greater number of vertices than those considered in 
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this paper. In this analysis we will consider that insertion 
and removal cost 0(log(7V)) (e.g., by using binary trees). 

At each epoch, arcs are brought up and/or down. For 
each < k < r, we note Md+k and M m -k the number of 
UP and DOWN events in the entire TVGs Cd+k and C m -k, 
respectively. Let M = maxt, {max{Md+fc, M m _fe}} be the 
maximum number of events in all the involved TVGs. In the 
worst case scenario where all possible arcs are updated at each 
epoch, we have M = 0(TN 2 ) events, where T is the number 
of epochs. Each processed event costs a modification of the 
arc counter (0(log(A jr ))), an adjacency matrix (0(\og(N)), 
and up to iV counters for the composed paths (0(N log(iV))). 
All of these must be performed for the r pairs of input TRGs. 
Adding everything together yields a worst-case time com- 
plexity of O (TMNlog(N)), or O (tTN 3 log(iV)) in terms of 
iV exclusively. In practice however M <C TN 2 , so the first 
formulation is more accurate. 

5. APPLICATIONS 

Having formalized reachability graphs and detailed a method 
for efficiently computing upper and lower approximations of 
them in the previous sections, we now study the reachability 
graphs of several synthetic and real-life traces for a variety 
of edge traversal times r and maximum delays 5. In particu- 
lar, we show how the properties of these reachability graphs 
place bounds on communication capabilities and highlight 
the asymmetric nature of dynamic networks. 

5.1 Datasets and metrics 

While we have calculated reachability graphs on many 
publicly available datasets, in this paper we present results 
based on two real-life contact traces selected for their short 
beaconing periods: 

Stanford 1 23 1 . As part of an epidemiology study, this trace 
captures face-to-face contacts among all students, teachers, 
and staff in a US high school between 7 a.m. and 4 p.m. 
The 782 ZigBee motes (TelosB Crossbow) sent beacons every 
20 seconds (sending times are synchronized in the published 
trace) . 

Rollernet 25 1 . Opportunistic sighting of Bluetooth devices 
by groups of rollerbladers carrying Intel iMotes during a roller 
tour. The 62 iMotes performed neighborhood scans every 15 
seconds. 

For comparison purposes, we also provide results based on 
two well understood synthetic mobility models: 

Random- Waypoint [18| . We simulated 50 nodes with 
speeds between 3 and 7 m/s from the stationary state in a 
1,000 x 500m 2 rectangle sending beacons every second with 
a 20 m transmission range. 



Table 3: Metrics on ^-regular reachability graphs of 
duration T. Xu is the value of X during the k epoch, 
i.e., X k = X({k+\)ri). 
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Community Model |19| . We simulated 50 nodes using 
the same parameters as above with 8 communities. In the 
community model, nodes with stronger social ties are more 
likely to be in geographic proximity. 

The characteristics of these four ^-regular traces are sum- 
marized in Table [2] 

For each reachability graph, we compute its time-varying 
dominating set (TVDS), i.e., a mutable set such that, at all 
times, there exists an incoming arc to any vertex in the TVG 
from a member of the TVDS. Thus, at any given time t, the 
TVDS is a traditional dominating set on the directed graph 
at time t. We note this TVDS T>s if it is derived from a 
reachability graph IZg. 

Even on static graphs, calculating a minimal dominating 
set is a classic NP-complete problem. Here we adapt the 
well-known greedy algorithm for choosing multipoint relays 
for broadcasting in a wireless network to the time-varying 
context [17| . The size of the dominating set calculated by 
this algorithm is within a factor log(iV) of the optimal, where 
N is the number of nodes. Our TVDS calculation algorithm 
reacts to arc UP/DOWN events as follows. 

1. If, after the arc event is processed, the previous domi- 
nating set is no longer a dominating set, then proceed 
to step [2] Otherwise do nothing. 

2. Iteratively build the new dominating set, starting from 
an empty set, by greedily adding the node with the 
greatest outgoing degree to nodes not yet covered by the 
new dominating set. If two nodes have equal outgoing 
degrees, pick the one that is in the previous dominating 
set. 

The time- varying dominating set thus calculated attempts to 
be both reasonably stable over time and close the optimal. 

The rest of this section focuses on the following metrics. 
Their formulas and average values are found in Table [3] 

• Density. The ratio of the number of arcs in a reacha- 
bility graph at time t over the total number of possible 
arcs. 

• Dominating set size. The normalized number of 
vertices in the dominating set at time t. 

• Asymmetry. Here we no longer consider directed 
arcs but undirected pairs of vertices. At time t, the 
asymmetry is the ratio of asymmetric pairs among pairs 
that have at least one edge between them. 

5.2 Algorithm accuracy 

The approximation algorithm detailed in Section [4] is ex- 
tremely accurate. Indeed, thanks to Theorem [5] and Defini- 
tion [8] the approximation calculates the exact value for all 
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Figure 6: The upper and lower bounds give nearly 
identical values of density and dominating set size. 
Example taken from the Rollernet trace with r=5s 
and <5=lmin. 




(a) 5 = 20min (b) S = 40min 



Figure 7: Subset of Stanford's 20 and 40-minutes 
reachability graphs during morning classes (r = 20s). 
Dark blue triangles are teachers; circles are stu- 
dents. The dark red arrows represent asymmetric 
arcs. The classroom structure is clearly visible. 

times t G Furthermore, the upper and lower bounds 
are nearly identical during epochs (t ^ jyN). Fig. [6] plots the 
density over time for both the upper and lower bounds of 
the Rollernet reachability graph for r = 5s and 8 = lmin. 
The plot has been zoomed in to show only a small span 
of the y-axis for 30 seconds. The upper and lower bounds 
on density are nearly equal at all times. So are the values 
of the dominating set size computed from the upper and 
lower approximations that only disagree in the circled area 
on Fig. [6] These observations hold for all the reachability 
graphs computed in this paper. Indeed, looking at the values 
of average density and average dominating set of the over 
5,000 pairs of upper/lower bound TVGs calculated in this 
paper, the maximum disagreement is 8.10~ 3 for the former 
and 4.10 -2 for the latter. In practice, the difference is smaller 
than the width of the line in plots. Therefore, in the rest 
of this paper, we only plot the values based on the lower 
approximation. 

5.3 Revealing temporal structural properties 

The dynamics of reachability graphs highlight temporal 
structural properties of the original connectivity trace that 
are not otherwise accessible. Fig. [7] shows snapshots of a 
subset of the reachability graph in the high school network 
captured in the Stanford dataset. When the delay is 20 
minutes, the classroom structure of the trace is clearly visible. 
In the original contact trace, edges within a classroom are 
unstable and lead to merges and splits of small connected 
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Figure 9: Proportion of connected pairs of vertices over time for two real-life datasets. 
divided into symmetric pairs (yellow) and asymmetric pairs (dark red). 
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Figure 8: Snapshots of rollernet's 10 and 60-seconds 
reachability graphs after 20 minutes (r = 5s). The 
rollerblading tour is moving from left to right. The 
dark red arrows represent asymmetric arcs. The 
asymmetry is caused by the acceleration phase in 
the accordion phenomenon [25] . 



components. In the reachability graph, a classroom is a stable 
complete subgraph with one teacher. A group of interacting 
teachers are also visible at the top of Fig. |7a| When some 
teachers and students later change classrooms, they create 
strong asymmetries in higher-delay reachability graphs such 
as the 40-minute one depicted in Fig. |7b| 

Fig. [8] shows snapshots of the reachability graph captured 
in the Rollernet dataset. These correspond to an acceleration 
phase, where the head of the tour (right on Fig.[8| pulls ahead 
of the rest of the rollerbladers. Due to the accordion phe- 
nomenon, the tail of the tour does not react immediately [25] . 
This completely prevents short delay communications be- 
tween these two groups (S = 10s on Fig. 8a I. With a longer 
delay (e.g., <5 = 60s on Fig. |8b[), backward communications 



towards the rear of the tour become possible (in particular 
through organizers who stop on the side of the road and let 
the tour pass them), thereby creating strong asymmetry in 
the reachability graph. 

A more systematic study of dynamic properties over time 
is shown on Fig. [9] It plots both the proportion of connected 
pairs of vertices (left axis) and the size of the dominating set 
(right axis). The pairs of vertices are further divided into 
symmetric pairs (yellow) and asymmetric pairs (red). At a 



given time, if the red histogram reaches 1 then a journey 
exists in at least one direction between all pairs of nodes. If 
the yellow histogram reaches 1, then a journey exist in both 
direction between all pairs of nodes. 

For a 1-minute delay, the Rollernet reachability graph alter- 
nates, sometimes very rapidly, between fully connected states 
and highly asymmetric partially connected states (Fig. |9a[ ). 
Therefore, any opportunistic communication system aiming 
for latencies under a minute will be strongly impacted by 
the accordion phenomenon. However, if a communication 
system can tolerate up to three minute delays, then it should 
be possible to smooth out the dynamic mobility. Indeed, the 
3-minute reachability graph is fully connected for the entire 
duration of the trace, and the size of its dominating set is 
almost always equal to one. 

The Stanford trace alternates static phases in classroom 
(the valleys on Fig. 9b I and dynamic phases moving between 
classrooms or around the food court (the peaks on Fig |9b[ ). 
The progressive "shrinking" of the valleys illustrates how 
reachability graphs "grow backwards" with increasing delays. 
Indeed, if there exist a journey from a to 6 within 20 minutes 
at time t, then there exists a journey from a to & within 40 
minutes at time t minus 20 minutes. Of course, after the 
increase in delays exceeds the width of the valleys, the reach- 
ability graph eventually reaches it maximum density. Note 
that in this case, an incompressible amount of asymmetry 
subsists throughout the trace. 

Due to space constraints, we have only shown, in this 
section, results for specific values of r but, as a general rule, 
the smaller the value of r, the faster the TRG becomes a 
clique. When analysing a network with a specific application 
in mind, the value of r can be set to a realistic value (e.g. 
message size over bit-rate), and the plots in this section give 
an immediate visual understanding of the communication 
possibilites. The next section examines quantitatively the 
importance the delay (S) and the edge traversal time (r) 
parameters. 

5.4 Bounds on communication capabilities 

Reachability graphs give straightforward bounds on com- 
munication capabilities. Indeed, the density at a given mo- 
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Figure 10: Average density vs. maximum delay S for different edge traversal times r (in seconds). 
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Figure 11: Average dominating set size vs. maximum delay S for different values of r (in seconds). 



ment is exactly equal to the maximum delivery ratio ex- 
pectancy of a perfect opportunistic routing protocol whose 
delay-tolerance is equal to S and whose message size over 
bit-rate ratio is equal to r. Furthermore, the size of the 
dominating set indicates the achievable offload ratio in a 
scenario where the opportunistic network is assisting an in- 
frastructure (e.g., 3G) for disseminating content to all nodes 
in the network [12] . 

Given real- world system requirements, i.e., wireless bit-rate 
estimates, messages sizes, target delivery ratio, and delay- 
tolerance, reachability graphs provide an immediate answer 
to the following question: can an opportunistic network 
support this service? If not, can it effectively supplement an 
infrastructure in an offloading scheme? 

Fig. [l0]plots the average density against the delay tolerance 
for all datasets and for increasing edge traversal times, while 
Fig. [IT] plots the average dominating set size. Rollernet and 
Random Waypoint share similar characteristics. Both are 
very sensitive to increasing edge traversal times. When these 
are close to one second, near 100% density is achievable with 



11a 



a couple of minutes of delay tolerance (Figs. 10a 10c 
and |llc[ ). In this case, they can support pure opportunistic 
communications. However, when edge traversal times are 
longer, tight delay constraints are impossible unless as part 
of an offloading scenario (e.g., r = 10s and S = 60s for 
Rollernet, Fig 10a l. For r = 20s, Random Waypoint cannot 



even provide offloading for reasonable delays (Fig. |llc| ). 

Similarly, Stanford and Community share similar features. 
Regardless of the delay-tolerance and message size, no pure 
opportunistic routing protoc ol ca n pr ovide anything near 
100% delivery ratio H (Figs. |l0b| and |l0d|. Despite this, 



they are both good offloading scenarios as the size of their 
dominating sets is consistently below 20% of the total number 



of nodes, thereby offering potential offload ratios of around 
80%. Indeed, in order to disseminate content to the entire 
network, pushing one copy per classroom in the Stanford 
case, or one copy per community in the Community case, 
plus copies to single nodes is an obvious strategy. However, 
for larger values of r (e.g., 20), Community is no longer able 
to offload content with reasonable delays (Fig. |lld|. 



5.5 Asymmetry 

Asymmetric communications are a fundamental aspect of 
opportunistic networks. Too strong focus on inter-contact 
times may lead to overlook asymmetry, but reachability 
graphs provide a natural way of studying and quantifying it. 

Fig.[l2]plots for all datasets the average asymmetry against 
the maximum delay for increasing values of the edge traversal 
time. The asymmetry for the 1Z T graph directly derived from 
the original contact trace is always as it only contains 
symmetric arcs (see Section 3.4|. Asymmetry in Random 
Waypoint follows a regular pattern: a bell-shaped curve 
of constant max value, whose width and center increase 



x The r = results for Stanford are an artifact of the 20- 
second resolution. 



with r (Fig. 12c I. In this scenario, depending on the delay 
constraint, up to 80% on average of connected pairs of vertices 
can only communicate in one direction. This is due to 
nodes traveling long straight distances creating asymmetric 
reachability to nodes they meet from nodes they had met 
earlier. As the maximum delay increases, return journeys 
using different intermediate nodes appear, the reachability 
becomes complete, and the asymmetry returns to 0. 

As previously, Rollernet and Random Waypoint show sim- 
ilar behavior. However, for Rollernet the maximum asymme- 
try also increases with r (Fig. |12a[ ). Indeed, as fewer edges 
are traversable, journeys become less likely between the front 
and the rear of the rollerblading tour. When such a journey 
is possible, it creates a longer-lasting asymmetry. In the 
Stanford trace, asymmetry follows the same increasing bell 
shape as in Rollernet, but never returns to (Fig. |12b|. In 
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Figure 12: Average asymmetry vs. maximum delay 8 for different values of r (in seconds) 
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fact, this incompressible minimum asymmetry also increases 
with r. 

6. RELATED WORK 

Adaptations of traditional static graph distance metrics 
and algorithms to time- varying graphs have yielded many dif- 
ferent concepts. For example, Orda et al. propose a shortest 
path algorithm for TVGs based on different waiting policies 
(unrestricted, forbidden, and source waiting) [2l]. Our work 
corresponds to the unrestricted policy, in which a message 
may wait for an unlimited amount of time anywhere along 
its path through the TVG. Bui Xuan et al. have proposed 
efficient algorithms for calculating shortest (in number of 
hops), fastest (in path traversal time), and foremost (i.e., 
earliest arrival) paths in TVGs 29 . All these algorithms are 
designed to compute the shortest paths to all destination 
from a source and a fixed starting time. Our algorithms, in 
contrast, computes the reachibility graph by estimating the 
shortest paths for all possible starting times. 

Several approaches to reachability in time- varying graphs 
exists. For strictly positive edge traversal times, a simple 
heuristic consists in dividing time into successive slots of 
length r and keeping only edges that are persistently present 
during each slot [8]. This provides a good lower-bound 
approximation for small values of r (i.e., less than n) whereas 
our approach can handle arbitrary edge traversal times. From 
a given starting time t, reachability among all pairs of nodes 
can be calculated by iterating over all edge UP/DOWN 
events [24]. This calculation can then be repeated for a 
sample of starting times [14) . This approach yields static 
reachability graphs for a discrete sequence of starting times, 
whereas the temporal reachability graphs defined in this 
paper calculate reachability in continuous time. 

Chaintreau et al, in their work on the diameter of op- 
portunistic networks, calculate a Last Departure / Earliest 
Arrival structure for each pair of nodes [5] . This structure 
can tell for any pair of nodes (A,B), at any time t, when 
is the earliest arrival for a message leaving A for B at time 
t. A reachability graph could be easily derived from these 
structures, but would unfortunately only cover the r = 
case. Our approach is more general since it can also handle 
non-zero edge traversal times. 

7. CONCLUSION AND FURTHER WORK 

In this paper, we introduced the notion of temporal reach- 
ability graphs. Given an edge traversal time and a maximum 
journey delay, temporal reachability graphs capture the tem- 
poral connectivity of the time-varying graphs they derive 



from. After formalizing the concept, we proved that regu- 
lar reachability graphs, which encompass all experimental 
datasets, can be composed to compute reachability graphs 
of higher maximum delay with very high accuracy. Fur- 
thermore, we proposed a scalable highly-parallel streaming 
algorithm for their efficient computation. By applying this 
algorithm to synthetic and real-life contact traces, we showed 
how reachability graphs provide fresh new insights on tempo- 
ral connectivity in time-varying graphs. In particular, they 
yield an immediate and intuitive picture of the communi- 
cation capabilities and offloading potential of opportunistic 
networks. 

This work on reachability graphs will be pursued in several 
directions. Firstly, our results could be extended to the 
more general case where the edge traversal time r is not 
constant but may take values among multiples of a TVG's 
resolution r\. Secondly, as seen in this paper, reachability 
graphs seem to reveal community structures that are not 
immediately apparent in the contact traces. They could 
therefore lead to new approaches for the difficult problem 
of community detection in time-varying graphs. Finally, 
much work remains to be done on the statistical analysis and 
modeling of reachability graphs. For example, what are the 
correlations between symmetric and asymmetric arcs from a 
given node? How do degree and inter-arc time distributions 
evolve with edge traversal time and delay? If reachability 
graphs turn out to be easier to model than their underlying 
time-varying graphs, could they be used as a first step for 
realistic synthetic connectivity models? 
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APPENDIX 

A. PROOFS 

A.l Proof of Theorem ffl 

Lemma 1. Let TZs be a reachability graph of a TVG Q 
with edge traversal time r. Let J be a journey from u to 
v such that departure(J) > t and arrival(J') < t + 8 + r . 
Then 30 < e < 1 such that (u,v) £ lZs+ eT (t). 

Proof. If arrival^) < t + S, then we set e = and we 
have (it, v) £ 7Zs(t). 

If arrival(J') > t + S, let < e < 1 such that arrival(J') — 
t + 8 + er. In this case, (w, «) G 72.,s+ eT (t) . □ 

Lemma 2. Let IZ^ be a reachability graph of a TVG Q 
with edge traversal time t. Let J be a journey from u to v 
such that departure(J') > t and arrival(J) < t + /x. T/ien 
fi> r and 30 < e < 1 suc/i that (u, v) £ 7£ M - eT (t + er). 

Proof. The temporal length of journey J is 8j < [M. 
Since it is a valid journey from u to v, Sj > t, hence fi > r. 

If departure(J') > t + r, then Sj<ji — T, and V0 < e < 1, 
(w, w) G 7£ M - r (t + r) C TZ„- cr (t + er). 

If departure(J') < t + t, then we set < e < 1 such that 
departure(J') = t + £7". Then, since arrival(J') < t -f- /j., 
5 j- < /i — er and (u, u) G 72. M _ E ,-(t + er). □ 

We can now prove Theorem [TJ 

Proof. (Theorem [TJ First, let us show that if an arc fits 
one of the decompositions then it belongs to 7Zs+f_i- For all 
times t, let (u, v) be an arc in (TZs+<lt ® TZ^-^t) (t) (0 < e < 
1). By definition, one of the following conditions holds. 

1. (u, v) G TZs+c T {t). Therefore (u, v) G TLs+^t) (Proposi- 
tion [TJ. 

2. (it, v) G TZfj,- iT (t + 8 + er). In this case, there exists a 
journey J in Q such that departure{J) >t + S + er>t 
and arrival{J) < t + S + er + fi — er < i + 8 + /i. 
Therefore, by definition, (it, tj) G IZs+^t). 

3. 3iu G V, s.t.(u, w) G 7?.i+ ET (t) and (u;, v) G 7?. M _ ET (t + 
5 + er). Let 7 and JT' be the respective journeys from 
u to w and from w to v. J U J' is a journey from it to 
v such that departure^ U J') = departure(J') > t and 
arrival^ U J) — arrival(J') < t + S + er + ji — er, i.e. 
arrival(J U J) < f + <5 + /i. Therefore («,«) G TZs+^it). 

Conversely, let us show that for any time t and any 
arc (it, v) G 7Zs+fi(t), there exists < e < 1, such that 

(U,V) G (ns+er^n^-er)^). Let J = {(d, tl), . . . , (e fe , t k )} 

be the journey from u to tj such that departure^) > t and 
arrival{J) < t + 8 + /x. 

Let i = max{l < j < k\tj < t + 8}. There are three possi- 
ble situations. 

1. i is not defined, i.e., departure(J) > t + 8. Since 
arrival{J) < t + 5 + /i, 30 < e < 1, such that (u, v) G 
Kp,- eT (t + 8 + er) (Lemma[2|. 

2. t^ = tk- Then J is a path from u to v such that 
departure(J') > t and arrival(J') < t + S + r. Therefore 
30 < e < 1, such that (it, v) G lZs+ eT (t) (Lemma [TJ. 

3. t\ < ti < ti+i < tk- In this case, we can divide the jour- 
ney J from it to v at time t into the sub-journeys J\ — 

{(ei,ti), . . . , (ei,ti)} andj 2 = {(e i+ i, £ i+ i), . . . , (e k ,t k )}- 
We set w = to(ei) = from(ei+i). We have arrival(Ji) = 



t t +r<t + 5 + r. Therefore, 30 < e < 1 such that 
(it, w) G T^+et^) (Lemma[TJ). Since J is a valid jour- 
ney in Q, ti+i > ii+r > t + 5 + er. Therefore 
departure(yj-z) > t+5+er, and (w,!;) G 7?, (J _ eT (t+5+er). 



In all three cases, (u, v) G (TZs+ CT 

A.2 Proof of Theorem [2] 



i^_ er )(t). □ 



Lemma 3 (One-hop journeys). Let Q be an n-regular 
TVG whose edge traversal time is r G r/W . For k G N, if a 
one-hop journey {(e,t)} exists in Q such that kn < t < (k + 
l)n, then all one-hop journeys {(e, t)} with kr\ < t < (k + l)ri 
also exist in Q . 

PROOF. Let r = nrj with n G N*. If J = {(e,t)} is a 
valid journey in t, then by definition for all t < C < t + r, 
e G 5(C)- Since kn < t < (k + l)n and 5 is 77-regular, then for 
all kn < C < (fc + "-)??, e G 5(C) (Definition^. Furthermore, 
since arrival(J') = t + r > (A: + 11)77, there exists < e < 1 
such that (k + n)n < (fc + n + e)n < (k + n + 1)77. In this case, 
e £ Q ((k + 11 + e)n). Therefore, since Q is r)-regular, for all 
{k+n)n < C < (fc+n+l)T7, e G 5(C) (Definition[7|. Finally let 
J = {(e, t)} be a one-hop journey in Q with krj < t < (A; + 1)77. 
We have departure(J) > ^17 and arrival(J') < (fe + 1 +n)?7. 
Since for all fci/ < C < (k + 1 + 11)77, e £ 5(C)i ^7 is a valid 
journey in Q. □ 



Lemma 4 (Epoch inclusion). Let Q be an r)-regular 
TVG whose edge traversal time is r G nW . For 8 £ J7N, let 
TZs be a reachability graph of Q . For k £ N, let times t a and 
tb be such that krj < t a < (k + l)n and krj < tb < (fc + l)i?. 
T/ien ft^ta) C 7i 5 (4). 

Proof. For <5 = 0, TZs is always empty and the lemma 
is trivially true. Hereafter we write 8 = dn, with d £ N*. 
If an arc (u, v) is in 7Zs(t a ), then there exists a journey 
,7 = {(ei, ti), . . . , (ek,tk)} from u to v such that t\ > t a and 

+ r < i a + 8. 

First we assume that t a < tb. 
Let i = minlj'lt, > tb + (j — l) 7 "}- 

If i is not defined then Vj, f a + (j — l)r < i, < tb + (j — l)r. 
We consider the journey J = { (e^, tj '} 1< - <fe , such that tj = 
tf, + (j — l)r. By construction, the departure times of all of 



its one-hop journeys verify tj 



> r. Furthermore, since 



kfj + (j — 1)717; < tj < tj < (k + 1)77 + (j — 1)7117, an one-hop 



journeys {(ej,tj)} are valid (Lemma|3J). Therefore j" is a 
valid journey from it to tj such that departure(j') > tb and 
arrival(J) <t b + 8, i.e., (it, v) £ 7Zs(t b ). 

If i is defined then there are two possibilities. If i = 1 
then departure^) > t(, and (11,71) is trivially in 1Zs(tb). If 
i > 1, then we can divide journey /J into two subjourneys 



0<i<i 



and J2 = {(e^tj)} 



i<j<k 



As above, we 



Ji={(e,-,t,)} 

can transform J\ into a valid journey J\ = {( e j^j} 0<J< ^ 
with tj = tb + (j — l)r such that departure(Ji) > ti, and 
arrival(Ji) < tb + (i— l)r. Since departure{ J-z) > tb + (i—l)r, 
J\ U ^2 is a valid journey from 11 to tj that leaves after tj 
and arrives before tb + 8. Therefore (u,v) £ 7Zs(tb). 

Conversely, let us assume that t a > tt- Let i — max { j\t 3 ■ + r < tb + 8 
If i is not defined, then Vj, tb + 8 + (k — j')r < tj + r < 
t a + 5 + (fc — j)r. We consider the journey J = { (e } ■, tj } 1<J<fe , 
such that tj = t a + 8 + (k — j — l)r. By construction, 



the departure times of all of its one-hop journeys verify 
tj—tj-i > t. Furthermore, since (k-\-d)n+(k—j — l)nn < tj < 
tj < (k + d+l)r)+(k — j — l)nri, all one-hop journeys {(ej,tj)} 
are valid (Lemma |3p - Therefore J is a valid journey from u 
to v such that departure(j) > tj, and arrival(j') <tb + d, 
i.e., (u, w) G Hs(tb). 

If i is defined then there are two possibilities. If i = fc then 
arrival(J') < tb + 5 and (w, w) is trivially in lZg(tb). If i < fc, 
then we can divide journey into two subjourneys Ji = 
{(ej,ij)} < f <i and = {(e,, ij)} i<3 < & . As above, we can 
transform J2 into a valid journey = {( e j>^j'}j <3 <fc with 

= tb+<5+(fc — j — l)r such that departure(J2) >tb+5+(k — 
z)t and arrival(J2) < i& + <$. Since arrival(Ji) < tb + (k — i)r , 
Ji U is a valid journey from w to w that leaves after tb 
and arrives before tb + 5. Therefore (u, v) G lZs(tb). □ 

We can now prove Theorem [2] 

Proof. For S = 0, 1Z& is always empty and the lemma 
is trivially true. Hereafter 5 G nW . For all k G N, and 
for all kn <U < (k + 1)77, 7£«(ti) C ^(fcr?) and 7£f (*],) C 
72-5 ((fe + 1)77)) (Lemma GJ. For times t 2 such that fc?7 < 
*2 < (fc + we have lZs(ti) Q Hsfa) and Ttgfa) Q 
1Zg(ti) (Lemma[4j. Therefore lZg(ti) = Tlgfa) and 7?.^ is 
^-regular. □ 

A.3 Proof of Theorem g] 

Lemma 5 (Journeys in regular graphs). Let Q be 
an n-regular TVG whose edge traversal time is r G 77N*. 
Let J be a journey in Q from u to v such that 3(a,b) G 
NxN, departure(J') > an and arrival(J') < (b + l)n. Then 
there exists another journey J from u to v that verifies 
departure(J) > an and arrival(J') < bn. 

PROOF. We write r = nrj with n G N*. If b < a + n, 
then arrival(J') — departure^) < r which is impossible. 
Therefore b > a + n. 

J = {(ei, £1), . . . , (ek, tk)} is a journey from utou. For all 
i, we note Ci the integer such that dn < tj < (<3j + 1)77. Here 
a = ci and 6 = Cfc+n. Indeed, we have arrival(J') = tk+r < 
(c fe + n + 1)7/. Then we define J = {(ei, C177), . . . , (e k , c k n)}. 

Firstly, since J is a valid journey, for all tj < t < tj + 
t, ej G <7(t). Because C/ is 77-regular, this implies that for all 
Ci?7 < t < Ci + t, ei G Q(t), and therefore all one-hop journeys 
{(ej,Ci?7)} in J are valid. 

Secondly, for all i, h+i—ti > r leads to (cj+i — Ci)n + n > r, 
i.e., (ci+i — Ci) + 1 > n. Since these are all integers, we get 
Cj+i —Ci>n and all the one-hop journeys in J may be taken 
successively and J is therefore a valid journey from u to v 
such that departure(J') > an and arrival(J') < (ck + n)n = 
bn. □ 



We can now prove Theorem [3] 

Proof. We write r = nrj with n G N*. At time t = an, 
let (tj, tj) be an arc in 7t(d+k)r) ® T^-(m-fc)?) with < fe < n. 
By setting e = k/n, (u,v) G 1Z( d+m ) v (an) with < e < 1 
(Theorem [TJ. 

Conversely, let us consider an (u,v) G TZ^d+m)^^). Since 
77177 > r, according to Theorem [TJ there exists < e < 1 such 
that one of the following holds (we note k the integer such 
that kn < enn < (k + 1)77). 

• (u,v) G U dv+CT (ari). Therefore (u,v) G 1Z {d+k ) v {an) 
(Lemma [Hj. 

• (w, «) G 1Z mv - eT ((a + d)n + er). With the inclusion 
rule from Proposition [Tj (u, v) belongs to 

72-( m -fc)t) ((a + d)?7 + enn). Furthermore, since lZ( m -k) n 
is ?7-regular (Theorem[2j, (u, v) G TZ( m -k) v ((a + d + k)n). 

• 3k; G V, (u,w) G Hd v +e T (an) and 

(ii>,v) G Ttm-q-tT {(a + d)n + er). Following the same 
reasoning as for the first two conditions, we show that 
3w G V, (u, w) G Tlfd+k^iari) and (w, v) G TZ( m ^ k )r, ((a + d + k)rj). 
In all three cases, there exists < k < n such that («, v) G 

Tt{d+k)ri ® 7£(m-fc)?7- D 

A.4 Proof of Proposition [3] 

Proof. For c? < 2n, by definition £ d ^ = TJd,,. By in- 
duction, lets us suppose the proposition true until a certain 
I > 2n — 1, and show that it also holds for I + 1. Let (w, tj) 
be an arc in jCa + i) v (t). We write a the integer such that 
an < t < (a + l)n. If t = ar?, then y£ (;+1) ^(t) = Tl (l+1)rl (t) 
(Definition [8] and Theorem[3| and (u, v) G TZ(i+i) n (t). Here- 
after we assume that t > ar\. Since I + 1 > 2n, there exists 
(d,m) G iV such that n<d<l,n<m<l, and 2 + 1 = d + m. 
In this case, by definition, there exists < k < n such that 
one of the following holds. 

. (u, v) G C id+ k^ ((a + 1)77). By setting e = (a+1) ^ t+fc \ 
one can verify that 30 < e < 1, such that (m, tj) G 

7?-d, ; + e-r(t). 

• (u,v) G C {m - k) „ ((a + d + k)n). By setting e= t ~ a n ^ +fc " , 
one can verify that 30 < e < 1, such that (m, tj) G 
TZ mrt - €T (t + dn + er). 

• 3tjj G V, (tj, tjj) G Cdr,{t) and (w, v) G J0, mr ,(t + dn). By 
induction, (u,w) G TZd v (t) and (tjj,tj) G TZ m ri(t + drj). 

With Theorem [TJ in all three cases, (it, tj) G TZ^ + ij v . By 
induction, this shows that Vd G N, d, Cdr/ Q TZdr/- □ 



